a statistical study on persian subject headings development

نویسندگان

m. tavakolizadeh ravari yazd university

چکیده

controlled vocabularies have been frequently used in information retrieval systems. control of the vocabularies and evaluating the utility of their terms are two critical questions. this research aims at the development of persian subject headings through statistical analyses. the current research was conducted on more than 450,000 records extracted from the electronic version of national bibliography of iran (nbi). data has been processed through data mining techniques. the correlation analysis was performed to determine the relationship between the number of items in nbi and the number of persian subject headings as well as the rank of each subject heading and its use frequency in nbi.the count of new subject headings vs. the count of new catalogued materials in nbi grew linearly at the beginning and increased logarithmically when the number of catalogued materials reached 3,200. the analysis of the use frequency of distinct headings within nbi resulted in three classes: most, frequent, and normal used subject headings. the findings partly agree with lancaster’s prediction, as he states that a controlled vocabulary will grow very fast in the beginning. it was also found that the majority of subject headings are rarely used by nbi. it is due to absence of a mechanism to control the building of new headings.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Metab2MeSH: annotating compounds with medical subject headings

SUMMARY Progress in high-throughput genomic technologies has led to the development of a variety of resources that link genes to functional information contained in the biomedical literature. However, tools attempting to link small molecules to normal and diseased physiology and published data relevant to biologists and clinical investigators, are still lacking. With metabolomics rapidly emergi...

متن کامل

Suggesting Subject Headings Using Web Information Sources

We proposed a method that suggests subject headings based on user queries when a pattern-matching algorithm fails to locate subject searches for Online Public Access Catalogs (OPAC). We combined information obtained from Wikipedia, Amazon, and Google for query expansion. Our method has two main advantages: (1) availability for any library without customizing OPACs, and (2) ability to suggest su...

متن کامل

Automated Assignment of Medical Subject Headings

Methods. A test collection of 200 MEDLINE citations published in 1997, with abstracts in English, were selected at random. The following methods of finding and ranking suitable MeSH descriptors have been investigated using this test collection: The Inquery Algorithm. This algorithm depends on parsing text into noun phrases, then using the Inquery search engine to match to MeSH descriptors. Cooc...

متن کامل

Boosting for Text Classification with Subject Headings

s: The aim of this study is to investigate how Medical Subject Headings (MeSH) as background knowledge source can improve text classification results. The hypothesis is experimented with two different sets of medical documents using HMM-based TC classifier. Experimental results show the improvement of the performance with MeSH in accuracy. Résumé : Le but de cette étude est d’examiner comment l...

متن کامل

a study of translation of english litrary terms into persian

چکیده هدف از پژوهش حاضر بررسی ترجمه ی واژه های تخصصی حوزه ی ادبیات به منظور کاوش در زمینه ی ترجمه پذیری آنها و نیز راهکار های به کار رفته توسط سه مترجم فارسی زبان :سیامک بابایی(1386)، سیما داد(1378)،و سعید سبزیان(1384) است. هدف دیگر این مطالعه تحقیق در مورد روش های واژه سازی به کار رفته در ارائه معادل های فارسی واژه های ادبی می باشد. در راستای این اهداف،چارچوب نظری این پژوهش راهکارهای ترجمه ار...

15 صفحه اول

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
international journal of information science and management

جلد ۱۰، شماره ۱، صفحات ۷۳-۸۸

کلمات کلیدی

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023